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1 Abstract 

The theoretical description of the evolution of cooperation presented by 
Bergstrom [T] based on assortative matching with partner choice allows to 
model the population dynamics in a game of Nonrepetitive Prisoners Dilemma. 
In this paper we present a short analysis of asymmetric effects brought into 
the game by self knowledge of the participants, that is the knowledge of 
one's own strategy. Within the same conceptual framework as introduced 
by Bergstrom inclusion of selfknowledge leads to different behaviour of the 
assortativity index and changes in payoffs for different strategies. 

2 Basic Assumptions and Notation 

Since the classic works by Maynard Smith El the evolutionary game the- 
ory has been used in numerous applications, ranging from biology, through 
sociology to applied economics. One of the branches of the discipline is the 
matching theory, which deals with the situations where two separate sets of 
players have to with each other. Matching theory can describe fields as sex- 
ual pairing, job market or even routing in computer networks. The literature 
dealing with matching theory is quite rich, the introduction can be found in 
the work of Roth and Sotomayor |3j. 

One of the interesting aspects of matching is the influence of intergroup 
traits on the matching process. Suppose that there are some subsets within 
each of the two sets of players that are matched, the question whether it 
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makes sense to include these subsets in analysis of the matching, or, in other 
words, whether there is any evolutionary advantage in matching like with 
like (or vice versa) has been given the name of assortativity matching. This 
paper is a direct extension of the analysis of assortative matching presented 
by Bergstrom [T]. We would repeat a lot of the definitions and notations 
used in that paper to allow to follow the reasoning and easily observe the 
differences of the two approaches. 

Following Bergstrom we assume that our population is divided between 
two types of players, cooperators and defectors. The ratio of cooperators 
within the population is x, correspondingly for defectors it is 1 — x. 

The members of the population form pairs (match) with other members. 
Assortativity of matching results from difference between of frequencies of 
matching between one's own type and with different type. The relative fre- 
quency of matchings between various types of players depends on individuals' 
types and proportions of their types within the population. 

Let p(x) be the conditional probability that one encounters a cooperator, 
given that one is a cooperator. 

Let q(x) be the conditional probability that one encounters a cooperator, 
given that one is a defector. 

Then, as argued in [T], the fraction of all encounters between two indi- 
viduals in which a cooperator meets a defector is a;(l — p(x)). Similarly, the 
fraction of encounters in which a defector meets a cooperator is (1 — x)q(x). 
Since these are just two different ways of counting the same matchings we 
have the initial equation: 

x(l — p(x)) = (1 — x)q(x) (1) 

Bergstrom defined the index of assortativity, a(x) as a difference between 
probability of homogenous and heterogenous matches. 

a(x) = p(x) — q(x) (2) 

By simple rearangements it is possible to express p(x) and q(x) through 
x and a(x): 

q(x) = x[l — (p(x) — q(x)] = x{l — a(x)) (3) 
p(x) = a(x) + x(l — a(x)) (4) 
2.1 Basic games scenarios 

Let's define the general payoff matrix for two participants, X and Y. Each of 
them has two possible choices of behaviour, Cooperating (C) and Defecting 
(D). The payoffs are defined as: 
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Y = C 


Y = D 


X = c 


R : R 


S,T 


X = D 


T,S 


P,P 



For the games considered here we assume that T > R > P > S . 

In this paper we would be assuming that the individual players use simple 
strategies (i.e. not resorting to adaptive or time-dependant strategies, such 
as Repeated Prisoners Dilemma (RPD) games). Moreover, we would assume 
that a player would accept a match when the expected payoff is greater than 
or equal zero (assumption of rational behaviour). This means that if the 
analysis, based on the information available to the player would show that 
he is certain to lose, the player would not match. This point would become 
important when we would start discussing the assortative matching with 
partner choice. 

Bergstrom has considered two types of games: prisoners Dilemma and 
the Game of Shared Output. The analysis of assortativity applies to both 
of them, but for simplicity we would concentrate on the game of Prisoners 
Dilemma. Within this game the payoff for player X is given by: 

Both coooperate: R— b — c 

X cooperates, Y defects: S = — c (S < 0!) 

X defects, Y cooperates: T = b 

Both defect: P = 

where b is the benefit conferred and c is the cost of action, with b > c. 

3 Matching rules 

As Bergstrom has noted, when partners have choice about their partners and 
matching results from mutual consent interesting possibilities arise. This 
allows us to predict the index of assortativity. In the game of Prisoners 
Dilemma everyone would prefer to be matched with a cooperator rather 
than with a defector. In the simplest case, if the players behaviour could be 
predicted by observation with a 100% accuracy then the cooperators would 
match with cooperators only (with payoff of R) and defectors only with defec- 
tors (with payoff of P — 0). In this case, regardless of x, p(x) = 1, q(x) = 
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and, consequently a(x) = 1. With R > P the only stable population would 
be the one of all cooperators. 

Bergstrom proposed a more general model, in which the accuracy of de- 
termination of partner type of behaviour is less than perfect. The proposed 
model is based on labelling the players with an imperfect indicator of their 
type (such a s reputation based on partial information, behavioral cues or 
a psychological test p]). We introduce notation in which, for example, D c 
denotes a true defector labelled and perceived as cooperator (good cheater), 
Cc denotes true and recognizable cooperator, Cd a cooperator unfortunately 
mislabelled as defector and Do a defector unable to 'hide' his character. 

Within the population of cooperators we have a fraction a of correctly 
labelled ones (Cc) and fraction (1— a) of cooperators mislabelled as defectors 
Co- Within the population of defectors we would have a fraction (3 of cor- 
rectly labelled Dp and fraction (1 — /?) of defectors mislabelled as cooperators 
(Dc). Bergstrom proposed that labelling should be 'reasonably reliable', i.e. 
a,/3 > 1/2, we would study the whole range of < a,/3 < 1. 

Because the decisions of the players would be based on what they assume 
about the other players (the labels) rather than the real situation the outcome 
of the matching (payoff matrix) becomes less trivial. 

Bergstrom assumed a crucial simplification in partner acceptance rules, in 
which 'apparent cooperators will all be matched with apparent cooperators and 
apparent defectors would be matched with apparent defectors'. This allowed 
to calculate the index of assortativity for any x. 

The assumption above does not, however, apply to the situation when 
the players are selfknowledgeable, that is when any player 'knows' his 'true 
nature'. In such situation true cooperators - even those labelled as defectors 
— would shirk from matching with perceived defectors. This would create 
quite complicated mutual consent rules and payoff matrix which are presented 
in several tables in Appendix 1. The cases where the mutual consent is 
marked by YES* are special in the sense of additional level of selfknowledge, 
possible to be added, namely the knowledge of other's reaction to one's own 
label. For example in the case X = Cd and Y = Dq, X, knowing he looks 
like a defector might wonder why an apparent cooperator would want to 
match with him. The only rational explanation being that Y is in fact a 
true defector, X would then rationalise not to match with Y. Such in depth 
analysis of 'I know that you know that I know . . . ' is beyond the scope of 
the current analysis. 
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3.1 Calculation of relative frequencies and assortative 
index 

In this section we will compare the calculated values of p(x), q(x) and a(x) 
for both the Bergstrom model and Selfknowledge model. 
Following Bergstrom we have: 



p B (x) 
Qb(x) 



a 2 x 



+ 



(1 — a) 2 x 



clb(x) — (a + (3 — 1) 



ax + (1 - (3)(1 - x) (1 - a)x + (3(1 - x) 

(1 — j3)ax (3(1 — a)x 

ax + (1 - (3)(1 - x) + (1 - a)x + (3(1 - x) 

(1 — a)x 



ax 



+ 



ax + (1 - (3)(l - x) (1 - a)x + (3(1 - x) 
For the selfknowledge model the probabilities are: 



(5) 
(6) 

(7) 



p s (x) = 



a 2 x 



ax + (1 - (3)(l - x) 
Qs(x) = (1 - (3)x 



a s (x) 



a 2 x 



(l-(3)x 



(8) 
(9) 
(10) 



ax+ (1 - (3)(l -x) 

Figure 1 in Appendix 2 compares ob(x) and as(x) for a = 3/4 and 
(3 = 3/5. One can immediately see the difference in behaviour of a(x) in the 
region of x ~ 1. To understand the meaning of the differences it is useful to 
consider the limiting cases for both models. 



Bergstrom 


Selfknowledge 


p(x) in the limit x — > 


PB (x)*x(^ + ^) 




q(x) in the limit x — > 


q B (x) x 


q s (x) -/?) 


in the limit x — > 1 


Pb(x) ~ 1 — (1 — x) 


p 5 (a:) ps a - (1 -x)(l - /3) 


in the limit x — * 1 




g5 (x)«(l-/3)-(l-x)(l-/3) 



The intuitive explanation of the limiting cases is quite instructive. Lets 
consider first the x — >• 0. In Bergstrom's model Pb(x) is proportional to x, 
with two terms corresponding to two situations: real cooperator laballed as 
cooperator matches real cooperator laballed as cooperator and real cooper- 
ator laballed as defector matches with real cooperator laballed as defector. 
Taking selfknowledge into account the second type of match is not allowed, 
as selfknowledgeable cooperator would not match with apparent defector, 
thus ps(x) has only one term. 

For q(x) in the limit of small x , Bergstrom's model gives qs{x) ~ x. This 
corresponds to the situation where every defector (either Dc or Do) would 
find a cooperator (respectively Cq and Cd). In the selfknowledge model only 
the defectors posing as cooperators would find willing cooperators, and thus 
q s (x) « x(l -P). 

Despite quantitative differences both model preserve linaerity oip(x) and 
q(x) for small x. For x approaching 1 their behaviour is however drastically 
different. Bergstrom's model predicts both functions to approach 1, so that 
their difference, a(x) is again linear in (1 — x). The selfknowledge model 
gives a different prediction. As x — > 1 ps(x) approaches a. This corresponds 
to an intuition that when there are almost no defectors, only the apparent 
cooperators would find willing partners. Those 'unfortunate' to be labelled 
as defectors, despite their true nature would remain unmatched. For x — ► 1 
we have qs(x) — > 1 — j3. This may be explained as follows: from the small 
number of defectors all cheaters labelled as cooperators would surely find a 
match of a cooperator. More importantly, the linear terms in expansion of 
both]?5(a;) and qs(x) for a; approaching 1 are the same, namely (l—x)*(l—/3). 
They cancel out leaving as(x) ~ a + (3 — 1 + 0(x 2 ). 

4 Comparison of payoffs 

Before we would summarize the results of the discussed models lets introduce 
the payoff functions for cooperators and defectors and their difference S(x). 
This difference would then determine the dynamics of the population. 
The payoff for a cooperator is given by: 



p(x)R + (l-p(x))S 



S + p(x)(R-S) 

S + a(x)(R -S) + x(l - a(x))(R - S). 



(11) 



Similar reasoning gives the payoff for a defector: 



q(x)T + (1 - q(x))P = P + x(l - a(x))(T - P). 



(12) 
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The difference between payoffs is 

8(x) = S-P + a(x)(R -S) + x(l - a(x))[(R + P) - (S + T)}. (13) 

For additive Prisoners Dilemma games (R + P) — (S + T) = and 

5(x) = S -P + a(x)(R- S) = a(x)b-c, (14) 

where we have used the notation from Section 2. In the following we would 
study only such additive games. 

In Bergstrom's model asix) for all values of a and j3 has zero values for 
x = and x = 1. For the range of values considered in [l] i.e. a,/3 > 1/2, 
the index of assortativity asix) is a concave function. It is interesting to 
observe that this property is preserved for all values of a and (5, with the 
exception of situation when a + (3 = l 1 . This allows to extend the results of 
Bergstrom to quite interesting situation of invasion of 'very good cheaters' 
into cooperators population. In such situation x ~ 1, @ < 1/2 and changing 
a does not introduce qualitative impact. As may be seen from Figure 2 
even for j3 < 1/2 the concave character of the a B (x) is preserved. This 
means that the concave character of 5b(x) = ob(x)& — c is also preserved. If 
c > max(a(x)6) then Sb(x) < and proportion of defectors increases, due to 
their higher payoff. There is only one stable point: all players are defectors. 
For c < max(a(x)6) as presented in [l] Sb(x) < for certain X\ < x < x 2 , 
and there are two stable equilibria: x = - all defectors and x = X2 - mixed 
population of defectors and cooperators (see Figure 4). 

In the selfknowledge model the situation is different. For certain values of 
b, c, such as c > max(a(x)6), as before, the proportion of defectors increases, 
due to their higher payoff. There is only one stable point: all players are 
defectors. 

When a + (3 — 1 < Owe have as(x) < for all x. In such a case, 
regardless of the payoff coefficients b and c defectors benefit always surpasses 
cooperators benefit and Ss(x) < for all x. The only equilibrium is at x = 
(see Figure 5). 

When a + j3 — 1 > the assortativity index increases monotonically with 
x, and for c < max(a(x)6) there is a region of x > X\ where 5 six) > 0. Two 
stable equilibria form depending on initial parameters, one at x = and one 
at x = 1. 

Interpretation of the above results based on common sense reasoning is 
as follows. 

For a + (3 <C 1 we have relatively large number of 'mislabelled' play- 
ers, both cooperators and defectors. In such situation cooperators are at 

J For the case of a + (3 = 1 we have asix) = for all x 
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disadvantage: they would not want to match with anyone 'looking like' a 
defector decreasing the number of matches with true cooperators, while still 
being prone to fall for a cheating defector. Defectors, on the other hand, see 
little to lose in matching with presumed defector and willingly match with 
presumed cooperators. Thus the negative value of a(x) and following it the 
greater payoff for defectors. 

For a + /3 ^> 1 there are relatively few mislabelled characters. For large x, 
as most of the small number of defectors look as defectors, the pure strategy 
of cooperators is actually beneficial. There are no matches such as Cd (co- 
operator labelled as defector) willing to match with Dp (correctly labelled 
defector) — and such matches were allowed in Bergstrom model. Thus the 
difference in the predictions of the two models. The defectors matching with 
defectors are at payoff disadvantege to cooperators matching with coopera- 
tors and eventually vanish from population. 

On the other hand, for small x and a+j3 ^> 1 the few cooperators present, 
surrounded by many correctly labelled defectors can match only within their 
group (ax) or with the mislabelled defectors (1 — f3)(l — x) — the other 
situation resulting in severly disadvantageous outcome. Here the only stable 
population is that of defectors only. 

A few words should be said about the validity of the two models for 
a + [3 ~ 1. For both models this choice of parameters yields a(x) ~ 
for all x. Moreover, the function a(x) is very flat, and for a + j3 = 1 this 
function is equal to for all x. the validity of predictions for both models 
are questionable for such case and the dynamics of the population should be 
determined by factors not taken into account. 

4.1 Conclusions 

In this short analysis we have extended the theoretical model proposed by 
Bergstrom, which allowed not only to analyse the results of assortative match- 
ing (i.e. preferable matching of similar players among themselves) but also 
to calculate the assortativity index a(x). The new model takes into ac- 
count influence of the selfknowledge of the players on their choices. It has 
been shown that for certain values of the population composition (defined by 
numbers of cooperators and defectors and proportions of these populations 
'correctly' labelled) the stable points predicted by Bergstrom's model and 
the selfknowledge model differ qualitatively, the latter model leading only to 
'pure' populations of all defectors or all cooperators. 



8 



References 

[1] T. C. Bergstrom The Algebra of Assortative Encounters and the Evolu- 
tion of Cooperation 

[2] J Maynard Smith, G. Price The Logic of Animal Conflicts, Nature 
246:15-18, 1973 

[3] J. Maynard Smith Evolution and the Theory of Games, Cambridge Uni- 
versity Press, 1982 

[4] A. Roth, M. Sotomayor Two-Sided Matching: A Study in Geme- 
Theoretic Modelling and Analysis, Econometric Society Monographs, 
Cambridge University Press, 1982 



9 



Appendix 1: Matching tables for Selfknowledge 
model and for the Bergstrom model 

Tables for the Selfknowledge model 







Cooperator - Cooperator 




with selfknowledge 


X 


Y 


Fraction 


Consent 


Payoff X 


Payoff Y 


Reasoning 


C c 


Cc 


a 2 x 2 


YES 


b-c + k 


b-c + k 


Full agreement of perceived 
and real behaviour 


C c 


C D 


a(l — a)x 2 


NO 






X would not agree to pair 
with Y, because Y looks like 
D 


C D 


Cc 


(1 — a)ax 2 


NO 






Y would not agree to pair with 
X, because X looks like D 


c D 


C D 


(l-a) 2 x 2 


NO 






Neither would agree to pair 
with the other: 'I know I am 
a C, why would I pair with 
someone who looks like D' 







Defector 


- Defector 






with selfknowledge 


X 


Y 


Fraction 


Consent 


Payoff X 


Payoff Y 


Reasoning 


D c 


D c 




YES 








Both would agree with hopes 
of cheating the other 


D D 


D D 


2 (l-x) 2 


YES 








Both would agree on principe 
of 'it would not hurt' 


D D 


D c 




YES 








X would agree with hope of 
cheating Y, Y on principe of 
'it would not hurt' 


D c 


D D 




YES 








Y would agree with hope of 
cheating X, X on principe of 
'it would not hurt' 



Tables for the Selfknowledge model 



10 







Cooperator - 


Defector 






with selfknowledge 


X 


Y 


Fraction 


Consent 


Payoff X 


Payoff Y 


Reasoning 


C c 


D c 


a(l-0)x(l-x) 


YES 


—c 


b 


Both would agree, X 
genuinely, Y as cheater 


D c 


Cc 


a(l-p)x(l-x) 


YES 


b 


—c 


Both would agree, Y 
genuinely, X as cheater 


Cc 


D D 


a(3x(l — x) 


NO 






X would not agree 


D D 


Cc 


a(3x{\ — x) 


NO 






Y would not agree 


C D 


D D 


(1 - a)(3x(l - x) 


NO 






X would not agree 


D D 


C D 


(1 - a)/3x(l - x) 


NO 






Y would not agree 


C D 


D c 


(1 -a)(l -P)x(l -x) 


YES* 


— c 


b 


X would agree perceiv- 
ing Y as C, Y would 
agree in the spirit 'it 
would not hurt' 


D c 


c D 


(l-a)(l-/3)x(l-x) 


YES* 


b 


— c 


Y would agree perceiv- 
ing X as C, X would 
agree in the spirit 'it 
would not hurt' 







Cooperator - Cooperator 




Bergstrom 


X 


Y 


Fraction 


Consent 


Payoff X 


Payoff Y 


Reasoning 


Cc 


C c 




YES 


b-c + k 


b-c + k 


apparent cooperator ONLY 
with apparent cooperator 


Cc 


C D 


a(l — a)x' 2 


NO 






NO mixed labels 


C D 


Cc 


(1 — a)ax' 2 


NO 






NO mixed labels 


C d 


C D 


(1 - afx 1 


YES 


b-c + k 


b-c + k 


apparent defector ONLY with 
apparent defector 







Defector 


- Defector 




Bergstrom 


X 


Y 


Fraction 


Consent 


Payoff X 


Payoff Y 


Reasoning 


D c 


D c 


(l-/3) 2 (l-x) 2 


YES 








apparent cooperator ONLY 
with apparent cooperator 


D D 


D D 


0\\ -xf 


YES 








apparent defector ONLY with 
apparent defector 


D d 


D c 


0(1-P)(1- X )* 


NO 






NO mixed labels 


D c 


D D 


P(l-P)(l- X )* 


NO 






NO mixed labels 



11 







Cooperator - 


Defector 






Bergstrom 


X 


Y 


Fraction 


Consent 


Payoff X 


Payoff Y 


Reasoning 


C c 


D c 


a(l-0)x(l-x) 


YES 


— c 


b 


apparent cooperator 
ONLY with apparent 
cooperator 


D c 


Cc 


a(l-p)x(l-x) 


YES 


b 


— c 


apparent cooperator 
ONLY with apparent 
cooperator 


Cc 


D D 


a@x(l — x) 


NO 






NO mixed labels 


D D 


Cc 


a(5x{l — x) 


NO 






NO mixed labels 


C D 


D D 


(1 - a)Px(l - x) 


YES 


—c 


b 


apparent defector 
ONLY with apparent 
defector 


D D 


c D 


(l-a)0x(l-x) 


YES 


b 


— c 


apparent defector 
ONLY with apparent 
defector 


C D 


D c 


(1 - a)(l - P)x(l - x) 


NO 






NO mixed labels 


D c 


C D 


(1 - a)(l - (3)x(l - x) 


NO 






NO mixed labels 
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Appendix 2: Figures 
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Figure 2: Assortativity index a# for Bergrstom model as a function of a; and 
(5. Values used in calculation: a = 0.8; it is worth noting that for (3 = 0.2 
a B = 0. 
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Figure 3: Assortativity index for Selfknowledge model as a function of x and 
(5. Values used in calculation: a = 0.8; it is worth noting that for (3 = 0.2 
a s = 0. 
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0.2 0.4 0.6 0.8 1 

x 

Figure 4: Example of difference between payoffs 5(x) for Bergstrom model. 
Two stable equilibria at x = and x = x 2 form. Values used in calculation: 
a = 0.8, j3 = 0.8, b = 10, c = 1. Arrows indicate direction of evolution of 
populations. 
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r 

-0.5 - 




0.2 0.4 0.6 0.8 1 

x 

Figure 5: Example of difference between payoffs S(x) for Selfknowledge model 
with a + /3 < 1 . Only one stable equilibrium at x = is present. Values used 
in calculation: a = 0.4, j3 = 0.4, b = 10, c = I. Arrows indicate direction of 
evolution of populations. 
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Figure 6: Example of difference between payoffs S(x) for selfknowledge model 
with a + j3 > 1 . Two stable equilibria at x = and x — 1 are present. 
Values used in calculation: a = 0.8, j3 = 0.8, b = 10, c = 1. Arrows indicate 
direction of evolution of populations. 
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